cd/entity/Olmo 3Β· homeβ€Ί entitiesβ€Ί Olmo 3
grep -l @olmo 3 /news/*.json | wc -l β†’ 1

Olmo 3

mentions 1 type Person feed RSS

// recent coverage 1 mentions

19:45
2026-06-14
lesswrong.com
ai-safety

Why Do Naive SFT Filters For Safety Properties Fail?

Google DeepMind researchers investigate why filtering supervised fine-tuning (SFT) data fails to remove safety-relevant properties from language models, proposing a method to identify the source of th…

// co-occurs with top 3 entities